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ENTERED 



3 <110> APPLICANT: Sano, Hiroshi 

4 - Kusano, Tomonobu 

5 Koizumi, Nozomu 

7 <120> TITLE OF INVENTION: Theobromine Synthase Polypeptide of Coffee Plant and the 
Gene , 

8 Encoding Said Polypeptide 
10 <130> FILE REFERENCE:' 026350-068 

12 <140> CURRENT APPLICATION 'NUMBER : .US 09/971,020 

C--> 13 <141> CURRENT FILING DATE: 2001-12-18 

15 <150> PRIOR APPLICATION NUMBER: JP 2000-307,149 

16 <151> PRIOR FILING DATE: 2000-10-06 
18 <160> NUMBER OF SEQ ID NOS : 8 

20 <170> SOFTWARE: FastSEQ for Windows Version 4.0 

22 <210> SEQ ID NO: 1 

23 <211> LENGTH: 378 ' ' 

24 <212> TYPE: PRT 

25 <213> ORGANISM: Caffea arabica 

27 <400> SEQUENCE: 1 

28 Met Glu Leu Gin Glu Val Leu His Met Asn Glu Gly Glu Gly Asp Thr 

29 1 5 10 15 

30 Ser Tyr Ala Lys Asn Ala Ser Tyr Asn Leu Ala Leu Ala Lys Val Lys 

31 20 25 30 

32 Pro Phe Leu Glu Gin Cys lie Arg Glu Leu Leu Arg Ala Asn Leu Pro 

33 35 '40 45 

34 Asn lie Asn Lys Cys -lie Lys Val Ala Asp Leu Gly Cys Ala Ser Gly 

35 50 55 60 

36 Pro Asn Thr Leu Leu Thr Val Arg Asp lie Val Gin Ser lie Asp Lys 

37 65 ' 70 75 80 

38 Val Gly Gin Glu Glu Lys Asn Glu Leu Glu Arg Pro Thr He Gin He 

39 85 .90 95 

40 Phe Leu Asn Asp Leu Phe Gin Asn Asp Phe Asn Ser Val Phe Lys Leu 

41 100 105 110 

42 Leu Pro Ser Phe Tyr Arg Lys Leu Glu Lys Glu Asn Gly Arg Lys lie 

43 115 ' 120 125 

44 Gly Ser Cys Leu He Ser Ala Met Pro Gly Ser Phe Tyr Gly Arg Leu 

45 130 135 140 

46 Phe Pro Glu Glu Ser Met His Phe Leu His Ser Cys Tyr Ser Val His 

47 145 150 155 160 

48 Trp Leu Ser Gin Val Pro Ser Gly Leu Val He Glu Leu Gly He Gly 

49 165 170 175 

50 Ala Asn Lys Gly Ser He Tyr Ser Ser Lys Gly Cys Arg Pro' Pro Val 

51 180 185 190 

52 Gin Lys Ala Tyr Leu Asp Gin Phe Thr Lys Asp Phe Thr Thr Phe Leu 

53 • 195 200 205 

54 Arg He His Ser Lys Glu Leu Phe Ser Arg Gly Arg Met Leu Leu Thr 

55 210 215 220 

56 Cys lie Cys Lys Val Asp Glu Phe Asp Glu Pro Asn Pro Leu Asp Leu 

57 225 230 . 235 240 
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58 


Leu 


Asp 


Met 


Ala 


He 


As n 


Asp 


Leu 


He 


Val 


Glu 


Gly 


Leu 


Leu 


Glu 


Glu 


59 










245 










250 










255 




60 


Glu 


Lys 


Leu 


Asp 


Ser 


Phe 


len 
no ii 


Tl p 




Php 


Php 


Thr 




Cp>- 
OCi 


Al a 




61 








260 










265 










270 






62 


Glu 


Val 


Lys 


Cys 


He 


Val 


Glu 


Glu 


Glu 


Gly • 


Ser 


Cys 


Glu 


He 


Leu 


Tvr 

xyr 


63 






275 










280 










285 








64 


Leu 


Glu 


Thr 


Phe 


Lys 


Ala 


His 


Tyr 


Asp 


Ala 


Ala 


Phe 


Ser 


lie 


Asp 


Asp 


65 




290 










295 










300 










66 


Asp Tyr 


Pro 


Val 


Arg 


Ser 


His 


Glu 


Gin 


He 


Lys 


Ala 


Glu 


Tyr 


Val 


Ala 


67 


305 










310 










315 










320 


68 


Ser 


Leu 


lie Arg 


Ser 


Val 


Tyr 


Glu 


Pro 


He 


Leu 


Ala 


Ser 


His 


Phe 


Gly 


69 










325 










330 










335 




70 


Glu 


Ala 


He 


Met 


Pro, 


Asp 


Leu 


Phe 


His 


Arg 


Leu 


Ala 


Lys 


His 


Ala 


Ala 


71 








340 










345 










350 






72 


Lys 


Val 


Leu 


His 


Met 


Gly Lys 


Gly 


Cys 


Tyr 


Asn 


Asn 


Leu 


He 


He 


Ser 


73 






355 










360 










365 








74 


Leu 


Ala 


Lys 


Lys 


Pro 


Glu 


Lys 


Ser 


Asp 


Val 















75 . 370 375 

78 <210> SEQ ID NO : 2 

79 <211> LENGTH: 1298 

80 <212> TYPE: DNA 

81 <213> ORGANISM: Caf f ea . arabica 

83 <4 00> ' SEQUENCE:' 2 

84 ageagtcgea attcgattgt ectgeatatg aatggagctc caagaagtcc tgcatatgaa 60 

85 tgaaggtgaa ggcgatacaa gctacgccaa gaatgeatec tacaatctgg ctcttgccaa 120 

86 ggtgaaacct ttccttgaac. aatgcatacg agaattgttg cgggccaact tgcccaacat 180 

87 caacaagtgc attaaagttg cggatttggg atgegcttet ggaccaaaca cacttttaac 240 
88. agtgcgggac attgtgcaaa gtattgacaa agttggccag gaagagaaga atgaattaga 300 

89 acgtcccacc attcagattt ttctgaatga tcttttccaa aatgatttca attcggtttt 360 

90 caagttgctg ccaagcttct accgcaaact cgagaaagaa aatggacgca agataggatc 420 

91 gtgectaata agegcaatge ctggctcttt etaeggcaga ctcttccccg aggagtccat 480 

92 gcattttttg cactcttgtt acagtgttca ttggttatct caggttccca gcggtttggt 540 

93 gattgaattg gggattggtg caaacaaagg gagtatttac tcttccaaag gatgtegtec 600 

94 gcccgtccag aaggcatatt tggatcaatt tacgaaagat tttaccacat ttctaaggat 660 

95 tcattcgaaa gagttgtttt cacgtggccg aatgetcett acctgeattt gtaaagtaga 72 0 

96 tgaattcgac gaaccgaatc ccctagactt acttgacatg gcaataaacg acttgattgt 780 

97 tgagggactt ctggaggaag aaaaattgga tagtttcaat attccattct ttacaccttc 840 

98 agcagaagaa gtaaagtgca tagttgagga ggaaggttct tgcgaaattt tatatctgga 900 

99 gacttttaag geccattatg atgetgeett ctctattgat gatgattacc cagtaagatc 960 

100 ccatgaacaa attaaagcag agtatgtggc atcattaatt agatcagttt acgaacccat 1020 

101 cctcgcaagt cattttggag aagctattat gectgactta ttccacaggc ttgegaagea 1080 

102 tgcagcaaag gttctccaca tgggcaaagg ctgctataat aatcttatca tttctctcgc 1140 

103 caaaaageca gagaagtcag acgtgtaaaa gtttgttttt agttggtttt tgtgccgttg 1200 

104 ggggtctttc gggtattgtc gttttgtatt cgtaataaaa gtgatgtgca agaataagat 1260 

105 atttagtaca atattttcat aaaaaaaaaa aaaaaaaa 1298 

107 <210> SEQ ID NO: 3 

108 <211> LENGTH: 385 

109 <212> TYPE: PRT 

110 <213> ORGANISM: Caff ea arabica 
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119 

XXL 


<4 00> SEQUENCE: 


■3 
J 






















I 1 T 

II J 


i ic u 
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Leu 
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jyie L 
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Gly Glu Gly Glu Ala 


1 1 A 
ll*t 


]_ 
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i n 

x U 








15 




1 1 c 
x x D 


Ser 


Tvr 
x i ■ L 


Ala 


Lys 


Asn 


■ 
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rnc 
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\j±n 


Leu 


Val 


Leu Ala 


Lys 


Val 


116 








20 










25 








30 






117 

XX f 


Lys 


Pro 


Val 


Leu 


Glu 


Gin 




v a x 


Arg 




Leu 


Leu 


Arg Ala 


Asn 


Leu 


118 






35 










40 










45 






1 1 Q 

X X-s 


Pro 


Asn 


lie 


Asn 


Lys 


Cys 


X X c 


Lys 


V a J. 


2V1 a 


Asp 


Leu 


Gly Cys 


Ala 


Ser 


1 9f) 




50 




















60 








1 91 

X £ X 


Glv 


Pro 


Asn 


Thr 


Leu 


Leu 


±111. 


Val 
v ai 


irp 


Asp 


J. ilX 


Val 


Gin Ser 


lie 


Asp 


1 99 

X£i £i 


65 










70 










7 R 








80 


123 


Lys 


Val 


Lys 


Gin 


Glu 


Met 


Lys ■ 


Asn 


Glu 


Leu 


Glu 


Arg 


Pro Thr 


lie 


Gin 


124 










85 










.7 V 








95 




1 9 R 


Val 


Phe 


Leu 


Thr 


Asp 


Leu 


Php 

IT JIG 


n n 


Asn 


Asp 


IT IlC 


Asn 


Ser Val 


Phe Met 


1 96 

1ZD 








100 










i nc 

1UJ 








110 






19 7 


Leu 


Leu 


Pro 


Ser 


Phe Tyr 


Arg 


Lys 


Leu 


Pin 
olU 


Lys 


Glu Asn Gly Arg Lys 


1 9 P 

J. .6 0 






1 1 s 

XX J 










1 9fi 
1 Z U 










125 






1 9 Q 
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Gly 


Ser 


Cys 
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He 


Ala 


Ala 
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Ser 
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Gly Arg 






i to 
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140 








1 ol 
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Glu 
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Ser 
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Leu 


1 "59 
Ijz 


145 
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Ala 
Ala 


Glu Gly Arg 


Leu 


Gly 
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9 SD 
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Glu 


Lys 
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260' 










96 S 

£i O J 
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147 


Glu 
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Val 
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Va 1 
val 
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vlU 
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Phe Glu 


He 
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97 S 
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1 A Q 

i*t y 
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Phe 
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; 

Tyr 
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Asp. 


150 
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151 


Asp 


Asp 
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Gin 


Val- 


Arg 


Ser 


His 


Ser 


Pro 


Val 


Tyr 


Ser Asp Glu 


His 
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305 










310 










315 
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Arg 
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His 


Val 
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Leu 


He 


Arg 


Ser 


Val Tyr 


Glu 


Pro 
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330 








335 
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Leu 


Ala 


Ser 


His' 


Phe 


Gly 


Glu 


Ala 


He, 


lie 


Pro Asp lie 


Phe 


His 


156 








340 










345 








B 350 
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Arg 


Phe 


Ala 


Thr 


Asn 


Ala 


Ala 


Lys 


Val 


lie 


Arg 


Leu Gly Lys 


Gly 


Phe 


158 






355 










360 










365 
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161 He . 

162 385 • 

165 <210> SEQ ID NO; 4 

166 <211> LENGTH: 1360 . ■ . 

167 <212> TYPE: DNA ' 

168 <213> ORGANISM:. Caffea arabica 

170 <4 00> SEQUENCE: 4 , . 

171 gtcctgcata ' tgaatggagc tccaagaagt cctgcatatg aatggaggcg aaggcgaagc 60 

172 aagctacgcc aagaattcat ccttcaatca actggttctc gccaaggtga aacctgtcct 120 

173 tgaacaatgc gtacgggaat tgttgcgg.gc caacttgccc aacatcaaca agtgcattaa 180 

174 agttgcagat ttgggatgcg cttccggacc aaacacactt ttaaccgttt gggacactgt 240 

175 acaaagtatt gacaaagtta agcaagaaat gaagaatgaa ttagaacgtc ccaccattca 300 

176 ggtttttctg actgatcttt tccaaaatga tttcaattcg gttttcatgc tgctgccaag 360 

177 cttctaccgc aaacttgaga aagaaaatgg acgcaaaata ggatcgtgcc taatagccgc 420 

178 aatgcctggc tctttccacg gcagactctt ccccgaggag tccatgcatt ttttacactc 480 

179 ttcttacagt cttcagtttt tatcccaggt tcccagcggt ttggtgactg aattggggat 540 
,180 cactgcgaac aaaaggagca tttactcttc caaagcaagt cctccgcccg tccagaaggc 600 

181 atatttggat caatttacga aagattttac cacattttta aggatgcgtt cggaagagtt* 660 

182 gctttcacgt ggccgaatgc tccttacttg catttgtaaa ggagatgaat gcgacggccc 720 

183 gaataccatg gacttacttg- agatggcaat aaacgacttg gttgctgagg gacgtctggg 780 

184 ggaagaaaaa ttggacagtt tcaatgttcc aatctataca gcttcagtag aagaagtaaa 840 
185 1 gtgcatggtt gaggaggaag gttcttttga aattttatac ttgcagactt ttaagctccg 900 

186 ttatgatgct ggcttctcta ttgatgatga ttgccaagta ' agatcccatt ccccagtata 960 

187 cagcgatgaa catgctagag cagcgcatgt ggcatcatta attagatcag tttacgaacc 1020 
; 188 catcctagca.agtcattttg gagaagctat tatacctgac atattccaca ggtttgcgac 1080 

189 gaatgcagca aaggttatcc gcttgggcaa aggcttctat aataatctta tcatttctct 1140 

190 tgccaaaaaa ccagagaagt cagacatata aaagcttgtt tttagttggt ttttgtgtta 1200 

191 tgggttgttt tctgatacgg ggaaaggatt cagtgcggtt ggggttctat ccgagtattg 1260 

192 tactttttat attattagtt ggtgtataat tattatgtta cattgttata ttcgtaataa 1320 

193 aagtgacgta caaaaataaa atattttcat aaaaaaaaaa 1360 

195 <210> SEQ ID NO: 5 

196 <211> LENGTH': 385 

197 <212> TYPE': PRT 

198 <213> ORGANISM: Caffea arabica 
<400> SEQUENCE: 5- 
Met Glu Leu Gin 

202 1 

203 Ser Tyi 
204- 

205 Lys Pre 
206 

207 Pro Asr 

208 50 

209 Gly Pre 

210 65 

211 Asp Va] 

212 * 

213 Val Phe 
214 



Leu 


Gin 


Glu 


Val 


Leu 


His 


Met 


Asn Gly Gly Glu Gly Asp 


Ala 






5 










10 


15 




Ala 


Lys 


Asn 


Ser 


Ser 


Phe 


Asn 


Gin Leu Val Leu Ala 


Lys 


Val 




20 










25 


' 30 






Val 


Leu 


Glu 


Gin 


Cys 


Val 


Gly 


Glu Leu Leu Arg. Ala 


Asn 


Leu 


35 • 










40 




45 


Ala 




He 


Asn 


Lys 


Cys 


He 


Lys 


Val 


Ala Asp Leu Gly Cys 


Ser 










55 . 






60 






Asn 


Thr 


Leu 


Leu 


Thr 


Val 


Arg 


Asp lie Val Gin Ser 


He 


Asp 








70 








75 




80 


Arg 


Gin 


Glu 


Met 


Lys 


Asn 


Glu 


Leu Glu Arg Pro Thr 


lie Gin 




85 










90 


95 




Leu 


Thr 


Asp 


Leu 


Phe 


Gin 


Asn 


Asp Phe Asn Ser Val 


Phe 


Met 




100 








105 


110 
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215 Leu Leu Pro Ser Phe Tyr Arg Lys Leu Glu Lys Glu Asn Gly. Arg. Lys 

216 115 120 • 125 

217 He Gly Ser Cys Leu He Ala Ala Met Pro Gly Ser Phe His Gly Arg 

218 130 135 140 

219 Leu Phe Pro Glu Glu Ser Met His Phe Leu His .Ser Ser Tyr Ser Leu . 

220 145 , . 150 155 , 160 

221 Gin Phe Leu Ser Gin Val Pro Ser Gly Leu Val Thr Glu Leu Gly lie 

222 165 170 175. 

223 Thr Ala Asn Lys Arg Ser lie Tyr Ser Ser Lys Ala Ser Pro -Pro Pro 

224 180, 185 190 

225 Val Gin Lys Ala Tyr Leu Asp Gin Phe Thr'. Lys Asp Phe Thr Thr Phe 

226 195 ■ 200 205 

227 Leu Arg He Arg Ser Glu Glu Leu Leu Ser Arg Gly Arg Met Leu Leu 

228 ' 210 215 220 

22.9"Thr Cys lie Cys Lys Gly Asp Glu Phe Asp Gly Pro Asn Thr Met Asp 

230 225 ■ 230 235 240 

231 Leu Leu Glu Met Ala He Asn Asp Leu Val Val Glu Gly His Leu Glu 

232 245 250 255 

233 Glu Glu Lys Leu Asp Ser Phe Asn Val Pro He Tyr Ala; Ala Ser Val 

234 260. . 265 270 . 

235 Glu Glu Leu Lys Cys lie Val Glu Glu Glu Gly Ser Phe Glu lie Leu 

236 275 280 285 

237 Tyr Leu Glu Thr Phe' Lys Leu Arg Tyr Asp Ala Gly. Phe Ser He Asp 

238 290 ' 295 300 > ' 

239 Asp Asp Cys Gin Val Arg Ser His, Ser Pro Glu Tyr Ser Asp Glu His 

240 305 310 . 315 320 

241 Ala Arg Ala Ala- His. Val Ala Ser Leu Leu Arg Ser Val Tyr Glu Pro 

242 ' 325 330 335 

24 3 lie- Leu Ala Asn His Phe Gly Glu Ala lie He Pro Asp He Phe His 

244 340' ' 345 350 

245 Arg Phe Ala Thr Asn Ala Ala Lys Val He Arg Leu Gly Lys Gly Phe 

246 355 - 360 365 

,247 Tyr Asn Asn Leu He He Ser Leu Ala Lys Lys Pro Glu Lys Ser Asp 

248 370 " 375 380 

249 He 1 . / . 1 

250 385 , ; 

253 <210> SEQ ID NO: 6 

254 <211> LENGTH: 1304 

255 <212> TYPE: DNA . 

256 <213> ORGANISM: Caffea arabica 

258 <400> SEQUENCE: 6 

259 tttagcagtc ccaattcgat ttatgtacaa gtcctgcata tgaatggagc tccaagaagt 60 

260 cctgcatatg aatggaggcg aaggcgatgc aagctacgcc aagaattcat ccttcaatca 120 

261 actggttctc gccaaggtga aacctgtcct tgaacaatgc gtaggggaat tgttgcgggc 180 

262 caacttgccc aacatcaaca agtgcattaa agttgcggat ttgggatgcg cttccggacc 240 
263'aaacacactt ttaacagttc gggacattgt acaaagtatt gacaaagtta ggcaagaaat 300 

264 gaagaatgaa ttagaacgtc ccaccattca ggtttttctg actgatcttt tccaaaatga 360 

265 tttcaattcg gttttcatgt tgctgccaag tttctaccgc aaacttgaga aagaaaatgg 420 

266 acgcaagata ggatcgtgcc taatagccgc aatgcctggc tctttccacg gcagactctt 480 
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